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Abstract — To reduce computational complexity and delay in 
randomized network coded content distribution, and for some 
other practical reasons, coding is not performed simultaneously 
over all content blocks, but over much smaller, possibly overlap- 
ping subsets of these blocks, known as generations. A penalty of 
this strategy is throughput reduction. To analyze the throughput 
loss, we model coding over generations with random generation 
scheduling as a coupon collector's brotherhood problem. This 
model enables us to derive the expected number of coded packets 
needed for successful decoding of the entire content as well 
as the probability of decoding failure (the latter only when 
generations do not overlap) and further, to quantify the tradeoff 
between computational complexity and throughput. Interestingly, 
with a moderate increase in the generation size, throughput 
quickly approaches link capacity. Overlaps between generations 
can further improve throughput substantially for relatively small 
generation sizes. 
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I. Introduction 



A. Motivation: Coding over Disjoint and Overlapping Gener- 
ations 

Random linear network coding was proposed in HI for 
"robust, distributed transmission and compression of infor- 
mation in networks". Subsequently, the idea found a place 
in a peer-to-peer(P2P) file distribution system Avalanche (2] 
from Microsoft. In P2P systems such as BitTorrent, content 
distribution involves fragmenting the content at its source, 
and using swarming techniques to disseminate the fragments 
among peers. Systems such as Avalanche, instead, circulate 
linear combinations of content fragments, which can be gen- 
erated by any peer. The motivation behind such a scheme is 
that, it is hard for peers to make optimal decisions on the 
scheduling of fragments based on their limited local vision, 
whereas when fragments are linearly combined at each node, 
topology diversity is implanted inherently in the data flows 
and can be exploited without further co-ordination. 
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The introduction of network coding in P2P content distri- 
bution systems brings about the issue of computational com- 
plexity. Consider distributing a file consisting of TV fragments, 
each made up of d symbols from a Galois field GF(q) of 
size q. It takes 0{Nd) operations in GF(q) to form a linear 
combination per coded packet, and 0(N 3 + N 2 d) operations, 
or, equivalently, 0(N 2 + Nd) operations per information 
packet, to decode the information packets by solving linear 
equations. According to the implementers of UUSee 0, a 
peer-to-peer video streaming application using randomized 
linear coding, even with the most optimized implementation, 
going beyond 512 fragments in each generation risks taxing 
a low-end CPU, typically used in power-efficient notebook 
computers. 

In an effort to reduce computational complexity, information 
packets are partitioned into disjoint subsets referred to as 
generations, and coding is done only within generations. This 
approach scales down the encoding and decoding problem 
from the whole file size 7Y to the generation size times the 
number of generations. The concept of generation in network 
coding was first proposed by Chou et al. in (4| to handle 
the issue of network synchronization. Coding over randomly 
scheduled generations was first theoretically analyzed by May- 
mounkov et al. in 0. Random scheduling of generations 
provides the "rateless" property which reduces the need for 
receiver feedback and offers resilience to various erasure 
patterns over the communication link. In addition, in the peer- 
to-peer content distribution setting, random scheduling is to 
some degree a good approximation when global co-ordination 
among peers is impractical. 

With random scheduling of generations, coded packets ac- 
cumulate faster in some generations than in others, even if all 
generations are scheduled equally probably. While waiting for 
the last generation to become decodable, redundant packets are 
accumulated in other generations. The situation is aggravated 
as the generation size decreases. One way to recover some 
of the throughput loss due to random scheduling without 
losing the benefits of reduced generation sizes is to allow 
generations to help each other in decoding. If the generations 
are allowed to overlap, after some of the "faster" generations 
are decoded, the number of unknown variables can be reduced 
in those generations sharing information packets with the 
decoded ones, which in turn reduces the number of coded 
packets needed to decode those generations, and enhances the 
throughput as a result. Our goal is to characterize the effects of 
generation size and overlaps on the throughput and complexity 
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in randomized linear network coding. 

B. Related Work 

The performance of codes with random scheduling of 
disjoint generations was first theoretically analyzed in by 
Maymounkov et al., who referred to them as chunked codes. 
Chunked codes allow convenient encoding at intermediate 
nodes, and are readily suitable for peer-to-peer file dissem- 
ination. In 0, the authors used an adversarial schedule as 
the network model and characterized the code performance 
under certain restrictions on the chunk(generation) size when 
the length of the information to be encoded tends to infinity. 

Coding with overlapping generations was first studied in 
(6) and Q with the goal to improve throughput. Reference 
JT) studied a "head-to-toe" overlapping scheme in which 
only contiguous generations overlap for a given number of 
information packets, and analyzed its asymptotic performance 
over a line network when the length of information goes to 
infinity. Another overlapping scheme with a grid structure was 
proposed in (6j, analyzed for short lengths (e.g., 4 generations) 
and simulated for practical lengths. When properly designed, 
these codes show improved performance over codes with 
disjoint generations. In our work, we offer an analysis of 
coding over disjoint and overlapping generations for finite but 
practically long information lengths. 

C. Organization and Main Contribution 

In this work, coding with both disjoint and overlapping 
generations together with random generation scheduling is 
studied from a coupon collection |8] perspective. Previously 
existing results from the classical coupon collector's problem, 
along with our extensions, enable us to characterize the code 
performance with finite information lengths, from which the 
asymptotic code performance can further be deduced. 

Section [II] introduces the general model for coding over 
generations, disjoint or overlapping, over a unicast (binary 
erasure) link, and characterizes the computational cost for 
encoding and decoding. 

Section [III] derives several results concerning linear inde- 
pendence among coded packets from the same generation. 
Such results serve to link coupon collection to the decoding 
of content that has been encoded into multiple generations. 
Included (Claim Q]) is a very good upper bound on the 
distribution of the number of coded packets needed for a 
specific generation for successful decoding. 

Section [IV] introduces the coupon collector's brotherhood 
problem and its variations that can be used to model coding 
over generations. Probability generating functions (Theorems 
[2] and HI and moments (Corollaries [3] and [5]) of the number of 
samplings needed to collect multiple copies of distinct coupons 
are derived for the random sampling of a. finite set of coupons 
in Section IIV-AI Relevant asymptotic results on expected 
values and probability distributions in existing literature are 
recapitulated in Section llV-Bl for performance characterization 
of coding over generations in the later part of the work. The 
section is presented in the coupon collection language and is 



in itself of independent interest for general readers interested 
in coupon collecting problems. 

In Sections [Vland [vn results from the previous two sections 
are combined to enable the analysis of the effects of generation 
size and overlaps on the decoding latency /throughput of coding 
over disjoint or overlapping generations. 

Section [V] studies the effects of generation size on the code 
throughput over a BEC channel for coding over disjoint gen- 
erations. Section [V-AI characterizes the mean and variance of 
the decoding latency (the number of coded packets transmitted 
until successful decoding) for finite information lengths, and 
Section IV-BI provides a lower bound for the probability of 
decoding failure. A large gain in throughput is observed when 
the generation size increases from 1 to a few tens. 

In Section IVII the random annex code is proposed as 
an effort to improve code throughput by allowing random 
overlaps among generations. Section IVI-CI lists an algorithm 
providing precise estimation of the expected decoding latency 
of the random annex code. The algorithm is based on the 
analysis of the overlapping structure in Section IVI-BI and the 
results from the extended collector's brotherhood in Section 
II VI Section IVI-DI demonstrates the effects of overlap sizes 
on code throughput is shown through both numerical com- 
putation and simulations. One of our interesting observations 
is that overlaps between generations can provide a tradeoff 
between computational complexity and decoding latency. In 
addition, without increasing the generation size (and hence 
computational complexity), it is still possible to improve 
code throughput significantly by allowing overlaps between 
generations. 

II. Coding Over Generations: The General Model 

In this section, we describe a general random coding scheme 
over generations. Generations do not have to be disjoint or of 
equal size, and random scheduling of generations does not 
have to be uniform. We describe the coding scheme over a 
unicast link. 

A. Forming Generations 

The file being distributed T is represented as a set of N 
information packets, pi,P2, ■ ■ ■ ,Pn- Each information packet 
is a c?-dimensional column vector of information symbols in 
Galois Field GF(q). Generations are non-empty subsets of T. 

Suppose that n generations, G\, G2, ■ ■ ■ , G n , are formed s.t. 
T = U™ =1 Gj. A coding scheme is said to be non-overlapping 
if the generations are disjoint, i.e., Vi 7^ j, Gi fl Gj = 0; 
otherwise, the scheme is said to be overlapping. The size 
of each generation Gj is denoted by gj, and its elements 
p\ , p 2 , . . . ,Pgi ■ For convenience, we will occasionally also 
use Gj to denote the matrix with columns Pi,p^\ ■ ■ ■ ,Pg) ■ 

B. Encoding 

In each transmission, the source first selects one of the n 
generations at random. The probability of choosing generation 

Gi is p u Yn=iPi = !■ Let P = {Pi,P2, ■ ■ ■ ,Pn)- Once 
generation Gj is chosen, the source chooses a coding vector 
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e = [ei, e2, • • • , e gj] T , with each of the gj components chosen 
independently and equally probably from GF(q). A new 
packet p is then formed by linearly combining packets from 
Gj by e: p = YliLi e iPi = e 'Gj (Gj here denotes a matrix). 

The coded packet p is then sent over the communication 
link to the receiver along with the coding vector e and 
the generation index j. Figure Q] shows a diagram of the 
communication between the source and the receiver. The 
generations shown in this example are chosen to be disjoint, 
but this is not necessary. 
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Fig. 1. A file divided into TV = 12 fragments and n = 4 (disjoint) 
generations containing h = 3 fragments each is available for distribution 
at the server. A receiver collects random linear combinations of randomly 
scheduled generations. 



C. Decoding 

Decoding starts with any generation Gj for which the 
receiver has collected gj coded packets with linearly inde- 
pendent coding vectors. The information packets making up 
this generation are decoded by solving a system of gj linear 
equations in GF(q) formed by the coded packets on one side 
and the linear combinations of the information packets by the 
coding vectors on the other. Since generations are allowed to 
overlap, a decoded information packet may also participate in 
other generations, from the equations of which the information 
packet is then removed as an unknown variable. Consequently, 
in all the generations overlapping with the decoded gener- 
ations, the number of unknown packets is reduced. As a 
result, some generations may become decodable even if no 
new coded packets are received from the source. Again, the 
newly decoded generations resolve some unknowns of the 
generations they overlap with, which in turn may become 
decodable and so on. We declare successful decoding when 
all N information packets have been decoded. 

The coding scheme described here is inherently rateless 
and easily extendable to more general network topologies that 
allow coding at intermediate network nodes. 

D. Packet Overhead 

Contained in each coded packet are the index of a generation 
Gj and a linear combining vector for Gj which together 



take up [log 2 n] + gj |~log 2 q] bits. Meanwhile, the data in 
each coded packet comprise d|"log 2 <7] bits. The generation 
size makes a more significant contribution to packet overhead 
and such contribution is non-negligible due to the limited size 
(~ a few KB) of transmission packets in practical networks. 
This gives another reason to keep generations small, besides 
reducing computational complexity. 

E. Computational Complexity 

The computational complexity for encoding is 
0(dma,x{gj}) per coded packet for linearly combining 
the gj information packets in each generation (recall that d is 
the number of GF(q) symbols in each information packet, 
as defined in Section Ill-Al l. For decoding, the largest number 
of unknowns in the systems of linear equations to be solved 
is not more than ma,x{gj}, and therefore the computational 
complexity is upper bounded by 0((max{gj}) 2 + d ma.x{gj}) 
per information packet. 

F. Decoding Latency 

In this paper, we focus on the tradeoff between the computa- 
tional complexity and the decoding latency of these codes over 
unicast links with erasures. Decoding latency here is defined 
as the number of coded packets transmitted until successful 
decoding of all the information packets, and overhead is the 
difference between the number of information packets and 
the decoding latency. We assume a memoryless BEC with a 
constant erasure rate e. Since our coding scheme is rateless, 
each coded packet is statistically of the same importance, and 
so the average decoding latency is inversely proportional to 
the achievable capacity (1 — e) of the link. The throughput of 
the code is inversely proportional to the decoding latency for 
given information length. 

III. Collecting Coded Packets and Decoding 

A generation Gi is not decodable until the number of 
linearly independent equations collected for Gi reaches the 
number of its information packets not yet resolved by decoding 
other generations. The connection between the number of 
coded packets collected and the linear independence among 
these coded packets has to be established before we can 
predict the decoding latency of codes over generations using 
the collector's brotherhood model that will be discussed in the 
next section. 

Let M(g, x) be the number of coded packets from a gener- 
ation of size g adequate for collecting x linearly independent 
equations. Then M(g,x) has expected value J9] 



(i) 
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Approximating summation by integration, from (Q]) we get 
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Let 



Vg(x) 



qX-l-g 

1 - q x - 1 -a 



l°g 



(3) 



We can use r) g (x) to estimate the number of coded packets 
needed from a certain generation to gather x linearly indepen- 
dent equations. 

In addition, we have the following Claim Q] which upper 
bounds the tail probability of M(g,g), the number of coded 
packets needed for a certain generation to gather enough 
linearly independent equations for decoding. 

Claim 1: There exist positive constants a q . g and «2,oo such 
that, for s > g, 

9-1 



Prob[M(g,g) > s] = 1 - JJ(1 - q k ~ s ) 



k=0 



< 1 - exp(-a qtg q {s 9) ) < 1 - cxp(-a 2 ,oo<7 
Also, since 1 — exp(— x) < x for x > 0, 

Pmb[M{g,g) > s] < a , 3 g- (s - 9) . 



-(s-fl)l 



(4) 



Proof: Please refer to Appendix lAl ■ 
We will use Claim[T]in Theorem[8]in Section[V]to derive an 
upper bound to the expected overhead of coding over disjoint 
generations. 

IV. Coupon Collector's Brotherhood and 
Collecting Coded Packets from Generations 

The coupon collector's brotherhood problem iflOll . ifTTl 
studies quantities related to the completion of to sets of n 
distinct coupons by sampling a set of n distinct coupons 
uniformly at random with replacement. In analogy, coded 
packets belonging to generation j can be viewed as copies of 
coupon j, and hence the process of collecting coded packets 
when generations are scheduled uniformly at random can be 
modeled as collecting multiple copies of distinct coupons. 

Because of possible linear dependence among coded packets 
and the overlaps between generations, the numbers of coded 
packets needed for each of the n generations to ensure suc- 
cessful decoding, however, are n random variables. Therefore, 
we must generalize the coupon collector's brotherhood model 
from collecting a uniform number of copies for all coupons to 
collecting different numbers of copies for different coupons, 
before it can be applied to the analysis of the throughput 
performance of coding over generations. In this section, the 
original collector's brotherhood model is generalized in two 
ways. And later in this paper, the analysis of the throughput 
performance of coding over disjoint generations in Section [V] 
rests on the first generalization, whereas that of coding over 
overlapping generations in Section [VI] rests on the second 
generalization. As our results are of more general interest than 
the coding-over-generations problem, we will express them in 
the coupon collection language. For example, the probability 
Pi of scheduling generation Gi (defined in Section HH here 
refers to the probability of sampling a copy of coupon Gi, for 
i = 1, 2, . . . , n. 



A. Generating Functions, Expected Values and Variances 
For any to 6 N, we define S m (x) as follows: 



s m (x) =1 + Yi + |r 



™m— X 



(m - 1)! 

S m (x) =0 (to < 0) and S^x) = e x . 



(to > 1) (5) 
(6) 



Let the total number of samplings needed to ensure that 
at least m,(> 0) copies of coupon Gi are collected for all 
i = 1, 2, . . . ,n be T(p, m), where m = (toi, to.2, . . . , m n ). 
The following Theorem [2] gives <PT(p,m)( z ), the generating 
function of the tail probabilities of T(p,m). This result 
is generalized from iflOl and IfTTl . and its proof uses the 
Newman-Shepp symbolic method in IflOl . Boneh et al. lTT2ll 
gave the same generalization, but we restate it here for use in 
our analysis of coding over disjoint generations. If for each 
j = 1,2,,.,, n, the number of coded packets needed from 
generation Gj for its decoding is known to be rrij (which 
can be strictly larger than the generation size gf), T(p, m) 
then gives the total number of coded packets needed to ensure 
successful decoding of the entire content when the generations 
are scheduled according to the probability vector p. 

Theorem 2: (Non-Uniform Sampling) Let 



Then. 



(7) 



(8) 



f°° r - 

/ [e-^ 1 -*) - J] [e-P^ 1 -*) - S mi [p t xz)e-^] )dx. 
^° i=i 

Proof: Please refer to Appendix [B] where we give a 
full proof of the theorem to demonstrate the Newman-Shepp 
symbolic method IflOl . which is also used in the proof of our 
other generalization in Theorem [4] ■ 

The expected value and the variance of T(p, m) follow 
from the tail probability generating function derived in Theo- 
rem |2] 

Corollary 3: 



E[T(p,m)]=<p T{p>Bl) (l) 



x)e 



dx, 



Var[T(p,m)] = 2ip' T{p>m) {l) + <Pt M (1) - <4( P ,m)(l)- 

Proof: Please refer to Appendix [B] ■ 
Note that in Theorem [2] and Corollary [5J rrii-s are allowed 
to be 0, thus including the case where only a specific subset 
of the coupons is of interest. Theorem [2] and Corollary [3] are 
also useful for the analysis of coding over generations when 
there is a difference in priority among the generations. For 
instance, in layered coded multimedia content, the generations 
containing the packets of the basic layer could be given 
a higher priority than those containing enhancement layers 
because of a hierarchical reconstruction at the receiver. 

In the following, we present another generalization of the 
collector's brotherhood model. Sometimes we are simply 
interested in collecting a coupon subset of a certain size, 
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regardless of the specific content of the subset. This can be 
further extended to the following more complicated case: for 
each i = 1, 2, . . . , A{A > 1), ensure that there exists a subset 
of {Gi, G2, ■ ■ ■ , G n } such that each of its ki elements has at 
least rrii copies in the collected samples. Such a generalization 
is intended for treatment of coding over equally important 
generations, for example, when each generation is a substream 
of multiple-description coded data. In this generalization, 
the generation scheduling (coupon sampling) probabilities are 
assumed to be uniform, i.e., pi = p2 = ■ ■ ■ = Pn = l/n. 

Suppose that for some positive integer A < n, integers 
kit . ■ . , kA and mi, . . . , mj satisfy 1 < kx < ■ ■ ■ < kA < n 
and 00 ~ too > m i > • • • > rriA > rriA+i = 0. We are 
interested in the total number U(m, k) of coupons that needs 
to be collected, to ensure that the number of distinct coupons 
for which at least nij copies have been collected is at least 
ki, for all i = 1,2,..., A, where m = (mi, rri2, • • • , itia) 
and k = (fci, fc2, . . . , fc^). The following Theorem [4] gives the 
generating function (pu(m,k)( z ) of E/(m,k). 

Theorem 4: (Uniform Sampling) 



>(*) 





ij+i 
i 



(9) 



"} 



dx. 



e n 

(zq,z 1 ,...,z a + 1 ): j = Q 
j = l,2,--,A 

Proof: Please refer to Appendix iBl ■ 
Same as for Corollary [3] we can find E[U (m,'k)] = 
< P(7(m,k) (!)■ A computationally wieldy representation of 
i?[[/(m,k)] is offered in the following Corollary [3J in a 
recursive form. 

Corollary 5: For k = hi, k\ + 1, . . . , n, let 

(j>0,k{x) = [(S mo (x) - S mi (x))e- X ] k ; 

For j = 1,2,..., A, let 



k ) [(S mj (x) - S m]+1 {*))z- x f 



<t>j,k{ x ) 

k 

= E 

w—k 

for k = fcj+i, kj + i + 1, . . 
Then, 

E[U(m,k)} 



OO 

1 I (1 





4>A,n{x)) dx. 



(10) 



It is not hard to find an algorithm that calculates 1 — <pA,n(x) 

in (cimi + c 2 (n-l)+c 3 Y,f=i YZ=k 3+1 ( k ~ k ])) basic arith " 
metic operations, where c\, C2 and C3 are positive constants. As 
long as toi = 0(An 2 ), we can estimate the amount of work 
for a single evaluation of 1 — <pA,n{x) to be 0(An 2 ). The 
integral ( TTOb can be computed through the use of an efficient 
quadrature method, for example, Gauss-Laguerre quadrature. 
For reference, some numerical integration issues for the special 
case where A = 1 have been addressed in Part 7 of [ 1 3 1 and 

m ma. 

In Section IVI1 we will apply Corollary [5] to find out the 
expected throughput of the random annex code, an overlapping 



coding scheme in which generations share randomly chosen 
information packets. The effect of the overlap size on the 
throughput can be investigated henceforth. 

B. Limiting Mean Value and Distribution 

In the previous subsection, we considered collecting a finite 
number of copies of a coupon set of a finite size. In this part, 
we present some results from existing literature on the limiting 
behavior of T(p, m) as n ^ 00 or ?ni = TO2 = • • • = m n = 
vn — ^ 00, assuming p\ = p% = • • • = p n = h- By slight abuse 
in notation, we denote T(p,m) here as T n (m). 

By Corollary E] 

p OO 

E[T n (m)] =n [l - (1 - S m (x)e- X ) n ] dx. (11) 
Jo 

The asymptotics of E[T n (m)] for large n has been discussed 
in literature iflOl . Ifl4l and 031, and is summarized in the 
following Theorem [6] (fTST l, and Theorem [7] 

Theorem 6: (|[R)) When n — > 00, 

E[T n (m)] = n\ogn + (m — l)n loglog n + C m ?i + o(n), (12) 

where C m = 7 — log(m — 1)!, 7 is Euler's constant, and 
m e N. 

For to 3> 1, on the other hand, we have ifTOl 



E[T n (m)] —> nm. 



(13) 



What is worth mentioning is that, as the number of coupons 
n — > 00, for the first complete set of coupons, the number 
of samplings needed is O(nlogn), whereas the additional 
number of samplings needed for each additional set is only 
C(nlog logn). 

In addition to the expected value of T n (m), the concen- 
tration of T n (m) around its mean is also of great interest to 
us. This concentration leads to an estimate of the probability 
of successful decoding for a given number of collected coded 
packets. We can specialize Corollary [3] to derive the variance 
of T n (m), as a measure of probability concentration. 

Further, since the tail probability generating functions de- 
rived in the last subsection are power series of non-negative 
coefficients and are convergent at 1, they are absolutely 
convergent on and inside the circle \z\ = 1 in the complex 
z-plane. Thus, it is possible to compute the tail probabilities 
using Cauchy's contour integration formula. However, extra 
care is required for numerical stability in such computation. 

Here we instead look at the asymptotic case where the 
number of coupons n —> 00. Erdos and Renyi have proven 
in lfl6l the limit law of T n (m) as n — > 00. Here we restate 
Lemma B from lfl4l by Flatto, which in addition expresses 
the rate of convergence to the limit law. We will later use this 
result to derive a lower bound for the probability of decoding 
failure in Theorem [9] in Section [V-BI 

Theorem 7: (031) Let 

in(w) = — (T n (m) — nlogn — (to — l)n log logn) . 



Then, 

Pr[Y n (m) < y] = cxp ( - 



(m - 1)! 



O 



log log n N 
logn . 
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Remark 1: (Remarks 2&3, Ifl4"l ) The estimation in Theorem 
|7] is understood to hold uniformly on any finite interval —a < 
y < a. i.e., for any a > 0, 



Prob [Y n (m) < y] — exp 



ex P(~y) 
"(m- 1)! 



< C(m,a) 



log log n 

log n ' 



n > 2 and —a < y < a. C(m, a) is a positive constant 
depending on m and a, but independent of n. For m = 1, the 
convergence rate to limit law is much faster: the O ' log log - 



log n 



term becomes O 



log n 



V. Coding Over Disjoint Generations 

In this section, we study the performance of coding over 
disjoint generations. We derive both an upper bound and a 
lower bound for the expected decoding latency (as defined 
in Section Hl-Fl i. We also derive the variance of the decoding 
latency. 



A. Expected Decoding Latency and Its Variance 

Let Mi (i = 1,2, ...,n) be the number of collected 
coded packets from generation Gi when G, first becomes 
decodable. Then Mj is at least gi, has the same distribution as 
M(gi,gi), the number of coded packets needed for a certain 
generation to gather enough linearly independent equations 
for decoding, as defined and studied in Section Hill M^'s are 
independent random variables. Let the decoding latency over a 
perfect channel be W(p, g), where g = (g±,g2, . . . , g n )- Use 
W e (p, g) to denote the decoding latency on a BEC(e). 

Let Xh (k = 1, 2, . . . ) be i.i.d. geometric random variables 
with success rate 1 — e. Therefore, -ELY^.] = and E[X 2 ] = 

w e ( P ,e)= ]T x it 



and therefore, 

E\W e (p,&)] 
Var[W.(p,g)} 



1 - e 
1 



E[W(p,s)], 



(14) 



(1-ef 



(Var[W(p, S )]+eE[W 2 (p,g)} 



(15) 



By definition, E[W(p, g)] is lower bounded by E[T(p, g)], 
the expected number of coded packets necessary for collect- 
ing at least <?.; coded packets for each generation Gi, and 
E[T(p,g)] is as given in Corollary [3] 

The following Theorem [8] gives the exact expression for the 
first and second moments of W(p, g), along with an upper 
bound for E[W(p, g)] considering the effect of finite finite 
field size q. Then, the expected value and the variance of 
W e (p, g) can be derived from (TBI and (TT~5T >. 

Theorem 8: The expected number of coded packets needed 
for successful decoding of all N information packets 



E[W(p,g)} 

f°° ( J l _ \ 

= y o ( 1 - II i 1 - e ~ PzXE *h [S Mi (Pix)])j dx (16) 

< r ( 1 - TT (l - e-»* x (S gi ( Pi x) (17) 
Jo \ i=1 v 

+ a g , gi q 9i e" iX /« - a q , gi q 9i S m ( Pi x/q)) ^jdx, 

E[W 2 (p,g)} (18) 

Jo X \ Pl 1 - E Mi [S Mt (Pix)] e-H* 

n v 
U (1 - [S Mj ( Pj x)] e-^ x ) )dx 

3=1 ' 

poo f n \ 

+ /, ( 1 - II ( X - [SmMx)])] dx 

where a q:9z = - ^feS) 1 ln i 1 _ 9 fe_9i ) , « = 1, 2, . . . , n. 

Proof: Please refer to Appendix [C] ■ 

In the case where generations are of equal size and sched- 
uled uniformly at random, we can estimate the asymptotic 
lower bound for E[W(p,g)] by the asymptotics of T n (m) 
given in (fl2l) and dT3l l. 

Figure | ^Ia)1 shows several estimates of E[W(p, g)], and Fig- 
ure |Jb)] shows the standard deviation of W(p, g) calculated 
from Theorem [8] and simulation results, when pi = — and 
<7j = g for i = 1, 2, . . . , n. The estimates are plotted versus 
the uniform generation size g for fixed N = ng = 1000. 

For coding over disjoint generations and a fixed total num- 
ber of information packets, both the expected value and the 
standard deviation of the decoding latency drop significantly 
as the generation size g grows to a relatively small value from 
the case where no coding is used (g = 1). Hence, throughput 
is improved by a moderate increase in the computational cost 
that scales quadratically with the generation size (see Section 
III-Eb . On the other hand, we also observe that past a moderate 
generation size (~ 50 — 100 coded packets for N = 1000), 
the decrease in decoding latency becomes slower by further 
increasing the encoding/decoding complexity. We therefore 
argue for a "sweet spot" generation size which characterizes 
the tradeoff between throughput and complexity. 

B. Probability of Decoding Failure 



In this subsection we assume uniform generation size and 
scheduling probability, i.e., pi = —, gi = g for i = 1, 2, . . . , n. 
For short, we denote W(p, g) as W n (g). From Theorem [7] 
we obtain the following lower bound to the probability of 
decoding failure as n — > oo: 

Theorem 9: When n — > oo, the probability of decoding 
failure when t coded packets have been collected is greater 



than 1 — cxp 



■^yynQogn) 9 1 exp(- 



■O 



( log log Tt ^ 
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Fig. 2. (a) Estimates of E[W(p, g)], the expected number of coded packets 
required for successful decoding when the total number of information packets 
is N = 1000, and both g and p are uniform. Estimates shown: lower bound 
E[T(p, g)]; upper bound (T7J; mean of W(p, g) in simulation; n — > oo 
asymptotic (12); m 3> 1 asymptotics )13> ; (b) Estimates of the standard 
deviation of W(p, g); (c) Estimates of probability of decoding failure versus 
the number of coded packets collected: Theorem [9] along with simulation 
results. 



Proof: The probability of decoding failure after acquiring 
t coded packets equals Prob[W„(<7) > t]. Since W n (g) > 

T n (g), 

Prob[W„( 5 ) > t] > Prob[T„(g) > t] 



1 -Prob 



Y n (g) < logn- (g - l)loglogn 



The result in Theorem [9] follows directly from Theorem [7] ■ 
Corollary 10: When g is fixed and n —> oo, in order 
to make the probability of decoding failure smaller than S, 
the number of coded packets collected has to be at least 
E[T n (g)] — nloglogy^j. If 6 = for some constant c, 
then the number of coded packets necessary for successful 
decoding has to be at least E[T n (g)] + cn\og(ng). 

Theorem 4.2 in J5) also gives the number of coded packets 
needed to have the probability of decoding failure below 5 = 
■Ax, but under the assumption that \n(N/6) = o(N/n) = o{g). 
In comparison, Corollary[l0]treats the case where g is constant. 

Figure | 3tc)| shows the estimate of the probability of decod- 
ing failure versus T, the number of coded packets collected. 
As pointed out in Remark Q] for m > 2, the deviation of 
the CDF of T n (m) from the limit law for n — » oo depends 
on m and is on the order of 0( lo f o 1 g °g" ) for m > 2, which 
is quite slow, partly explaining the deviation of the limit law 
curves from the simulation curves for m = 5 and m = 10 in 
Figure | 2Jc)[ 

VI. Coding Over Overlapping Generations 

Even when generations are scheduled uniformly at random, 
there will be more coded packets accumulated in some of 
the generations than in others. The "slowest" generation is 
the bottleneck for file decoding. It is then advisable to design 
a mechanism that allows "faster" generations to help those 
lagging behind. In this section, we propose the random an- 
nex code, a new coding scheme in which generations share 
randomly chosen packets, as opposed to previously proposed 
"head-to-toe" overlapping scheme of Q. 

We provide a heuristic analysis of the code throughput 
based on our results for the coupon collection model and an 
examination of the overlapping structure. Previous work on 
coding over overlapping generations, J6) and Q, lacks accu- 
rate performance analysis for information blocks of moderate 
finite lengths. On the other hand, the computational effort 
needed to carry out our analysis scales well with the length 
of information, and the performance predictions coincide with 
simulation data. In addition, we find that our random annex 
code outperforms the "head-to-toe" overlapping scheme of Q 
over a unicast link. 

In this section we conveniently assume that the coded 
packets are sent over a perfect channel, since here we are 
interested in comparing the performance of different rateless 
coding schemes. 



A. Forming Overlapping Generations 

We form n overlapping generations out of a file with N 
information packets in two steps as follows: 
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1) Partition the file set T of N packets into subsets 
Bi, B2, ■ ■ ■ , B n , each containing h consecutive packets. 
These n = N/h subsets are referred to as base gener- 
ations. Thus, Bi = {p(i-i)h+i,P(i-i)h+2,---,Pih} for 
i = 1, 2, . . . , n. N is assumed to be a multiple of h for 
convenience. In practice, if N is not a multiple of h, set 
n = \N /h~\ and assign the last [N — (n — l)h] packets 
to the last (smaller) base generation. 

2) To each base generation Bi, add a random annex Ri, 
consisting of I packets chosen uniformly at random 
(without replacement) from the N — h = (n — l)h 
packets in J-\B{. The base generation together with its 
annex constitutes the extended generation d = BiURi, 
the size of which is g = h + I. Throughout this 
paper, unless otherwise stated, the term "generation" will 
refer to "extended generation" whenever used alone for 
overlapping generations. 

The generation scheduling probabilities are chosen to be 
uniform, p\ = pi = ■ ■ ■ = p n = 1/ n. The encoding and 
decoding procedures run the same as described in the general 
model in Section HI] 

B. Analyzing the Overlapping Structure 

The following Claims [TT] through [14] present combinatorial 
derivations of quantities concerning the frequency at which 
an arbitrary information packet is represented in different 
generations. 

Claim 11: For any packet in a base generation Bk, the 
probability that it belongs to annex R r for some r G 
{l,2,...,n}\{fc} is 

'N - h - 1\ (N - h\ _ I I 
l-l )'\ I J ~ N -h~ (n-l)Zi' 
whereas the probability that it does not belong to R r is ff = 

1 - TT. 

Claim 12: Let X be the random variable representing the 
number of generations an information packet participates in. 
Then, X = 1 + Y, where Y is Binom(n — 1, it). 

£7[X] = l + (n-l)7r=l + |, 



and 



Var[X] = (n- \)tttt. 



Claim 13: In each generation of size g = h+l, the expected 
number of information packets not participating in any other 
generation is /i7f(™ -1 ) m he~ l / h for n 3> 1; the expected 
number of information packets participating in at least two 
generations is 



I + h[l - Tft"- 1 ) 
for n > 1 and I > 0. 



l + h 



1 



,-l/h 



< mm{g, 21} 



The following Theorem [T5l quantifies the expected amount 
of help a generation may receive from previously decoded 
generations in terms of common information packets. In the 
next subsection, we use Corollary |5] and Theorem [15] for a 
heuristic analysis of the expected throughput performance of 
the random annex code. 

Theorem 15: For any I C {1, 2, . . . , n} with |/| = s, and 
any j e {1, 2, . . . , n}\I, 

fl(s) = E[\ (Ui e /Gi) n Gjl] =g-[l- n s ] + sh ■ tttt s (19) 

where \B\ denotes the cardinality of set B. When n —> 00, 
if I; -J- a and | ->• 8, and let w(J3) = f2(s), then u)(0) ->■ 
h [(1 + a) (1 - e- Q/3 ) + a/3e- Q ' 3 ] . 

Proof: Please refer to Appendix ID] ■ 

C. Expected Throughput Analysis: The Algorithm 

Given the overlapping structure, we next describe an analy- 
sis of the expected number of coded packets a receiver needs 
to collect in order to decode all N information packets of F 
when they are encoded by the random annex code. We base 
our analysis on Theorem [TBI above. Corollary [5J in Section HVI 
and also ([3]) in Section [Till an d use the mean value for every 
quantity involved. 

By the time when s (s = 0, 1, . . . , n — 1) generations have 
been decoded, for any one of the remaining (n— s) generations, 
on the average fi(s) of its participating information packets 
have been decoded, or equivalently, (g — f2(s)) of them are 
not yet resolved. If for any one of these remaining generations 
the receiver has collected enough coded packets to decode 
its unresolved packets, that generation becomes the (s + l)th 
decoded; otherwise, if no such generation exists, decoding 
fails. 

The quantity r/ g (x) defined in (f3]) in SectionlTXIIestimates the 
number of coded packets from a generation of size g adequate 
for collecting x linearly independent equations. By extending 
the domain of rj g (x) from integers to real numbers, we can 
estimate that the number of coded packets needed for the (s + 
l)th decoded generation should exceed m' s = \r/ g (g — f2(s))] • 
Since in the random annex code, all generations are randomly 
scheduled with equal probability, for successful decoding, we 
would like to have at least m' coded packets belonging to one 
of the generations, at least belonging to another, and so 
on. Then Corollary [3J in Section [IV] can be applied to estimate 
the total number of coded packets needed to achieve these 
minimum requirements for the numbers of coded packets. 

The algorithm for our heuristic analysis is listed as follows: 

1) Compute 0(s — 1) for s = 1, . . . , n using Theorem [TBI 

2) Compute m' s = \r/ g (g — 0(s — 1))] for s = 1,2, . . . , n 
using (0; 

3) Map m' s (s = 1,2, ...,n) into A values m,j (j = 



1, 2, . . . , A) so that rrij 



'fe,-_i+i 



l fcj-i+2 



for j = 1,2, . . . , A, ko = and ItA = n; 



Claim 14: The probability that two generations overlap is 4) Evaluate (QJJ in Corollary [5J with the A, kjS, and rrijS 



I N-2h 
\l.l,N-2h-2l 



)/( 



N-h\ 



The number of generations over- 



lapping with any one generation Gi is then 



Binom 



1. 



1 - 



N — 2h 
l,l,N-2h-2l 



N-k 



obtained in StepO, as an estimate for the expected num- 
ber of coded packets needed for successful decoding. 
Remark 2: The above Step [3]) is viable because fi(s) is 
nondecreasing in s, f] g (x) is non-decreasing in x for fixed g, 
and thus m' 3 is non-increasing in s. 
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Although our analysis is heuristic, we will see in the next 
section that the estimate closely follows the simulated average 
performance curve of the random annex coding scheme. 

D. Numerical Evaluation and Simulation Results 

1 ) Throughput vs. Complexity in Fixed Number of Genera- 
tions Schemes: Our goal here is to find out how the annex size 
I affects the decoding latency of the scheme with fixed base 
generation size h and the total number of information packets 
N (and consequently, the number of generations n). Note 
that the generation size g = h + I affects the computational 
complexity of the scheme, and hence we are actually looking 
at the tradeoff between throughput and complexity. 

Figure [3] shows both the analytical and simulation results 
when the total number N of information packets is 1000 and 
the base generation size h is 25. Figure |3 |{a)| shows h + l — 
f2(s) for s = 0,1,..., n with different annex sizes. Recall 
that fi(s) is the expected size of the overlap of the union of 
s generations with any one of the leftover n — s generations. 
After the decoding of s generations, for any generation not yet 
decoded, the expected number of information packets that still 
need to be resolved is then h + l — 0(s). We observe that the 
h + I — f2(s) curves start from h + I for s = and gradually 
descends, ending somewhere above h — I, for s = n — 1. 

Recall that we measure throughput by decoding latency 
(Section III-Fb . Figure | j{b)| shows the expected performance 
of the random annex code, along with the performance of the 
head-to-toe overlapping code and the non-overlapping code 
(I = 0). Figure \ $cj\ shows the probability of decoding failure 
of these codes versus the number of coded packets collected. 

• Our analysis for the expected decoding latency closely 
matches the simulation results. 

> Figure [ 3Jb)| shows that by fixing the file size N and the 
base generation size h, the expected decoding latency 
decreases roughly linearly with increasing annex size I, 
up to I — 12 for the random annex scheme and up to I = 8 
for the head-to-toe scheme. Meanwhile, the decoding 
cost per information packet is quadratic in g = h + I. 
Beyond the optimal annex size, throughput cannot be 
further increased by raising computational cost. 

• The random annex code outperforms head-to-toe over- 
lapping at their respective optimal points. Both codes 
outperform the non-overlapping scheme. 

• As more coded packets are collected, the probability of 
decoding failure of the random annex code converges to 
faster than that of the head-to-toe and that of the non- 
overlapping scheme. 

Overlaps provide a tradeoff between computational com- 
plexity and decoding latency. 

2) Enhancing Throughput in Fixed Complexity Schemes: 
Our goal here is to see if we can choose the annex size 
to optimize the throughput with negligible sacrifice in com- 
plexity. To this end, we fix the extended generation size 
g = h + l and vary only the annex size I. Consequently, the 
computational complexity for coding does not increase when 
I increases. Actually, since some of the information packets in 
a generation of size g could already be solved while decoding 
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Fig. 3. N = 1000, h = 25, q = 256: |(a)| Difference between the generation 
size and the expected size of overlap with previously decoded generations 
(h + l — Q(s)); [(b)! Expected number of coded packets needed for successful 
decoding versus annex size /;|(c)| Probability of decoding failure 



other generations, the remaining information packets in this 
generation can be solved in a system of linear equations of 
fewer than g unknowns, and as a result increasing I might 
decrease the decoding complexity. 

Figure 2] shows both the analytical and simulation results for 
the code performance when the total number N of information 
packets is fixed at 1000 and size g of extended generation fixed 
at 25. 

> Again our analytical results agree with simulation results 
very well; 

• It is interesting to observe that, without raising computa- 
tional complexity, increasing annex size properly can still 
give non-negligible improvement to throughput; 

• Figure | ^Ja)| shows a roughly linear improvement of 
throughput with increasing I, up to I = 10 for the random 
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Fig. 4. N = 1000, g = h + l = 25, q = 256:|(a)lExpected number of coded 
packets needed for successful decoding versus annex size I; |(b)| Probability 
of decoding failure 



annex scheme and up to I = 6 for the head-to-toe scheme. 
Increasing I beyond affects throughput adversely; 

• The random annex code again outperforms head-to-toe 
overlapping at their optimal points. Both codes outper- 
form the non-overlapping scheme; 

• We again observe that the probability of decoding failure 
of the random annex code converges faster than those of 
the head-to-toe and the non-overlapping schemes. 

When the overlap size increases, we either have larger 
generations with unchanged number of generations, or a larger 
number of generations with unchanged generation size. In both 
cases the decoding latency would increase if we neglected the 
effect of overlaps during the decoding process. If we make use 
of the overlap in decoding, on the other hand, the larger the 
overlap size, the more help the generations can lend to each 
other in decoding and, hence, reducing the decoding latency. 
Two canceling effects result in a non-monotonic relationship 
between throughput and overlap size. 

The effect of generation size on the throughput of random 
annex codes is further illustrated in Figure Figure |5]plots the 
optimal expected decoding latency achievable by random an- 
nex codes and the corresponding optimal annex size versus the 
generation size for N = 1000 and q = 16. The plotted values 
are calculated using the algorithm listed in Section IVI-CI We 
can see from Figure that with the random annex code and a 
generation size of 20, the expected throughput is better than 
what can be achieved with coding over disjoint generations 
and a generation size of 50. The reduction in computational 
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Fig. 5. Optimal expected decoding latency and the optimal overlap size with 
random annex codes. N = 1000, q = 16 



complexity is considerable. Capturing the optimal overlap size 
in terms of other parameters of the code is our object of interest 
in the future. 



Appendix A 
Proof of ClaimQ] 

For i = 1,2, ... ,n and any s > g, we have 
lnProb{A/( 5 , 5 ) < s} 

h-l 
= 1*11(1 



-.k — s 



9-1 

= 5>(i-«* _ ") 

fc=0 A:=0 

9—1 oo 00 1 9- 1 



k=0 j=l 



j=l J k=0 



j-l'-g) V 1 g -f.7-i)f»-fl) 1 ~g " 

1 1 - q~3S 
J 



7 1 



3=1 



=<r (s ~ 9) ]nVrob{M(g,g) < g) 
> g -( s -ff) lim lnProb{M(c/, g) < g] 

The claim is obtained by setting 

a q , g = -lnProb{M(,g,g) < g}, 

and 

a 2 ,oo = - lim lnProb{M(#,#) < g). 
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Appendix B 

Proofs of Generalized Results of Collector's 
Brotherhood Problem 

Proof of Theorem \2\ 

Our proof generalizes the symbolic method of iflOl . 

Let £ be the event that the number of copies of coupon Gi 
is at least m, for every i = 1,2, ... ,n. For integer t > 0, 
let £(t) be the event that £ has occurred after a total of t 
samplings, and let £(t) be the complementary event. Then, 
the tail probability Prob[T(p, m) > t] = Prob[f (t)] = v t . 

To derive v t , we introduce an operator / acting on 
an n-variable polynomial g. f removes all monomials 
x™ 1 x 2 2 ■ ■ . x'n n in g satisfying wi > mi,...,w n > m n . 
Note that / is a linear operator, i.e., if g\ and g 2 are two 
polynomials in the same n variables, and a and b two scalars, 
we have af( gi ) + bf(g 2 ) = f(agt + bg 2 ). 

Each monomial in (x± + - ■ ■+x n ) t corresponds to one of the 
n* possible outcomes of t samplings, with the exponent of Xi 
being the number of copies of coupon Gi. Since the samplings 
are independent, the probability of an outcome X™ 1 x 2 2 ■ ■ ■ x^ n 
is • ••<**■ Hence, the probability of is f((x 1 + 

■ ■ ■ + XnY), when evaluated at Xi = pi for i = 1, 2, ... n, i.e., 



Vt = f((xi H Vx n Y) 



Xi—pi,i—l,...,7l- 



(20) 



Hence, d20]l and © lead to 

•£T(p, m ){z) = 53 / {{.Xi + \-X n Y) 2*| Xi=pi> i =1) 



t>0 



The identity 



r°° 1 

/ ^e~y d y = l 



and the linearity of the operator / imply that 



poo 

</>T(p,m)0)=/ 

J ° t>0 



/((^ + •••+£„)') 



z t y t e v dy 



/(E 



(.Tizy H h a;„zy)' 



e v dy 



f (exp(xizy 



■x n zy))e v dy (21) 



evaluated at x% = i = 1, . . . ,n. 

We next find the sum of the monomials in the polynomial 
expansion of cxp(xi + ■ ■ - + x n ) that should be removed under 
/. Clearly, this sum should be Y[7=i ( eXi ~ S mi (xi)), where 
S is defined in (0 and (O). Therefore, 

/ {exp{x\zy H h x n zy)) \ Xi = Pi ,i=i,...,n 

n 

= e** -l[(e^y - S mi ( Pi zy)) . 



<-PT{p,m){z) 



1> 



S rril (piZy)) 



e v dy 
(22) 



Proof of Corollary \3\ 
Note that 



t PT(p,m) ( z 



00 00 



J2 E Prob[r(p,m)=i]^ 

t=o j=t+l 

00 j— 1 

£Prob[:r(p,m)=j]£y 



t=o 



£?[T(p,m)] = ^jProb[T(p,m) = j] = Vr (p , m) (l). 
Similarly, 

00 

^T( P ,m)(^) = X>Prob[T(p,m) > t}^- 1 

t=0 

00 j—1 
= ^Prob[T(p,m)=j]^te t - 1 



t=o 



3 1 

¥>T( P ,m)(l) = E oJ'O' ~ l)Prob[T(p,m) = j]. 



3=1 



Hence, 



E[T(p,m) 2 } =^j 2 Prob[T(p,m) = j] 



and consequently, 

Var[T(p, m)] = 2^ (p m) (1) + ^ T(p , m) (1) - <p 2 T[p m) (1). 
We have 

<PT(p,m)( Z ) = 

-,(!-«) e-"^ 1 -*) - 5 mi _ 1 ( Pl xz)e-P^ 

n \ 
■ J] (e^w^ 1 - 2 ) - S mj {p d xz)e-^ x ) jdx, 
3=1 ' 
and from there, we can get <p' T ( p m )(l) an d Var[T(p, m)]. 

Proof of Theorem [4] 

We again apply the Newman-Shepp symbolic method. Similar 
to the proof of Theorem [2] we introduce an operator / acting 
on an n-variable polynomial g. For a monomial x™' 1 . . . xj£", 
let ij be the number of exponents w u among w\, . . . ,w n 
satisfying w u > kj, for j = I,..., A. f removes all 
monomials X™ 1 . . . x™ n in g satisfying i\ > k\, . . . ,%a > kA 
and i\ < ■ ■ ■ < ia- f is again a linear operator. One can see 
that 



<Pu(m,-k)(z) = 

f (exp(xizy H h x n zyj) e~ y dy\ Xi=X2= ... =Xn 



(23) 
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We choose integers = io < i\ < ■ ■ ■ < %a < «a+i = n, 
such that ij > kj for j = 1, . . . , A, and then partition indices 
{1, . . . , n} into (A + 1) subsets X\,... ,Ia+i, where = 
1, . . . , A + 1) has ij — elements. Then 



A+l 



(24) 



n n (^-i ~ ^ (- Tj; )) 

equals the sum of all monomials in exp(xi + • • • + x n ) with 
{ij — ij-i) °f the n exponents smaller than rrij-i but greater 
than or equal to rrij, for j = 1, . . . , A+l. (Here 5 is as defined 
by (|5]|6]l.) The number of such partitions of {1, . . . , n} is equal 
to f n ) = T\t n C 1+1 )- Finally, we need to sum 
the terms of the form (I24D over all partitions of all choices of 
ii, . . . , %a satisfying kj < ij < ij+i for j = 1, . . . , A: 

f (expOizy H + x n zy)) \ Xi= ... =Xti= _l = cxp(zy)- 

A ,. 



n 



(•o.'i.-iU+i) 1 j=0 

j = l,2,...,A 



S mj { ) S mj+1 { ) 
n n 



(25) 



Bringing (f25l into ( 1231 gives our result in Theorem [4] 

Appendix C 
Proof of Theorem[8] 



E[W(p, S )] 



Expression ( TT~8T > for E[W 2 (p 1 g)] can be derived in the same 
manner, and then the expression for War[W(p, g)] immediately 
follows. 

Appendix D 
Proof of Theorem[T"51 

Without loss of generality, let / = {1, 2, . . . , s} and j = 
s + 1, and define TZ a = Uf =1 i4 B s = Uf =1 B 4 , and Q a = 
U| =1 Gi for s = 0, 1, ... - 1. Then, £ [| (U ie/ Gi) n G,\] = 
E [\Q S n G s +i|]. For any two sets X and Y, we use X + Y 
to denote X l)Y when X n F = 0. 

Q a n G s+1 ={B S + K S \B S ) n (Ba+i + 

=B a n i? s+ i + n s n s s+ i + (K a \B a ) n i? s+1 , 

and therefore 

E[\G s nG s+1 \] =E[\B S D R s+1 \] + (27) 

£[|ft s n B a+1 \] + E[\(n s \B a ) n i? s+1 |]. 

Using Claim [TT] we have 

E[\B S nR s+1 \] = shir, (28) 
E[\H a nB a + 1 \] = h[l-(l-ir)'], (29) 
E[\(n a \B a ) n i? s+1 |] = (n - a - l)hw[l - (1 - ^) s ], (30) 

into 



m \i=l 



1 -[IE* = "^K 1 - SmtOHxje.-"*) 



i—1 rrii 



poo / n 



dx 
(26) 



-e-" x E Ml [S Mi (jHx)]) dx. 



comes from the distributivity. 
Since 

^ [S Mt (pix)] = £ ^^Pr[Mi > j], 
by Claim [U 

<S«(^) + f;^a M 5-^ 



S 9i (pix) + a q , gi q gi e piX/q - a q , gi q g <S gz (pix/q), 



where 



9.-1 

lnPr{A/( 3i , 9i ) < 9l } = - In (l - 9 

fc=0 



for i = 1, 2, . . . , n. 
Hence, we have (fTTI i. 



where 7r is as defined in Claim [TT] Bringing d28D - 
(l27l . we obtain (fT9l . 

Furthermore, when n — > oo, if l/h — > a and s/n —> f3, then 



E[\G S n G s .+ l|] = ff • [1 - 7f S ] + Sh ■ TTTT S 



>h(l + a 
hap(l - 



1-1- 



n - 1 



re/9 



n/3 



n-1 

(l + a)(l -e- Q ' 3 ) + a/3e- a ' 3 
l + a-(l + a- a/3)e" Q/3 
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